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Abstract 

The 0(n) nonlinear sigma model in 1 + 1 dimensions is examined as quantum 
mechanics on an infinite-dimensional configuration space. Two metrics are defined in 
this space. One of these metrics is the same as Feynman's distance, but we show his 
conclusions concerning potential energy versus distance from the classical vacuum are 
incorrect. The potential-energy functional is found to have barriers; the configurations 
on these barriers are solitons of an associated sigma model with an external source. 
The tunneling amplitude is computed for the 0(2) model and soliton condensation is 
shown to drive the phase transition at a critical coupling. We find the tunneling paths 
in the configuration space of the 0(3) model and argue that these are responsible for 
the mass gap at 9 = 0. These tunneling paths have half-integer topological charge, 
supporting the conjecture due to Affleck and Haldane that there is a massless phase at 
weak coupling and 9 = n. 
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1 Introduction 



The two-dimensional 0(3) nonlinear sigma model has been regarded as interesting 
by elementary-particle physicists because of the features it shares with QCD, such as 
asymptotic freedom |] and instantons 0. Condensed-matter physicists have studied 
the model in connection with antiferromagnetic spin chains ||, the quantum hall effect 
|4[] and (more traditionally) classical ferromagnets. The 0(n) model can be studied by 
the 1/n expansion and the 0(2) model can be understood a Coulomb gas of vortices 
||. The spectrum and S-matrix of the 0(3) model can be determined exactly 0, @. 
Haldane argued that at 9 = ir the model should be equivalent to the half-integer XXX 
spin chain and is therefore gap less [[|. 

In spite of these successes, there are still unresolved issues. The spin-chain argu- 
ments, though extremely compelling, need to be supplemented by other evidence, since 
the "Haldane mapping" || is not strictly a mapping at all, but relies on some assump- 
tions about local properties of the ground state. Indeed, naively one might conclude 
from the spin-chain arguments that any 0(3) invariant, integer-spin antiferromagnetic 
chain has a gap; yet some integer-spin chains are gapless ||. 

In support of Haldane's conclusion Affleck and Haldane || suggested that conden- 
sation of merons [IDf, objects with half- integer topological charge, was responsible for 



the gap at 9 = 0, while at 9 = n and small bare coupling the merons bind in pairs, 
rendering the spectrum gapless. These arguments were not precise for the 0(3) sigma 
model, but were made by a conjectured extrapolation from a model with 0(3) broken 
to 0(2). Nonetheless, numerical work of Bietenholz et. al. with the symmetry unbro- 
ken suggests that this picture is valid |Tl[. Other observations concerning the role of 



half-integer topological charge were made recently in reference [12 



In this paper we study the 0(2) and 0(3) nonlinear sigma models using some ideas 
developed for non-Abelian gauge theories [13[]. Some clarification of these ideas in 



reference ||13|| , as well as some preliminary work for the sigma model is presented in 



reference 14 



The starting point is the definition of distance functionals on the fields. We work 
with two such functionals. One of these is the natural distance related to the kinetic 
term. The other is defined on the orbit space, which consists of configurations modulo 
global rotations. The latter distance functional is implicit in the work of Feynman 



Tojj . Each of these functionals is a metric. The physical utility of these metrics are 
discussed. The second metric is used to define a coordinate on the orbit space. We then 
examine the minima of the potential energy for fixed values of this coordinate. What 
appears is a structure of energy barriers in configuration space. On these barriers, 
the configurations are identical to solitons - not of the ordinary sigma model, but of a 
related classical field theory. These solitons are topologically stable for the 0(2) model, 
but not for the 0(3) model. They can be written down explicitly in terms of elliptic and 
related functions. We show that the barriers of the 0(2) model can be penetrated only 
above a critical coupling. At this coupling a phase transition is driven by condensation 
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of solitons. We thereby obtain an entirely quantum-mechanical explanation of the 
transition, complementing the standard statistical-mechanical explanation The 
barriers of the 0(3) sigma model are considerably more complicated than those of the 
0(2) model. We find these and examine them in some detail. We then show that 
the barrier-penetration process has topological charge one half; a result which strongly 
supports the arguments of Affleck and Haldane |§. 

Our Hamiltonian analysis is quite different from the usual path-integral saddle- 
point approach to the sigma model. The viewpoint is one almost never considered for 



the model, except in the appendix in Feynman's paper [15|. As the style of his article 



is difficult to follow, few physicists have pursued the ideas further. We therefore see the 
need for a nontechnical summary of our motivation and results, which is the content 
of the next section. 



2 Distance and potential-energy topography 

Field theories are infinite-dimensional quantum systems. In the Schrodinger represen- 
tation, wave functionals are mappings from a field configuration space to the complex 
plane. The elements of this space, for the time being denoted as tp, are functions from 
the space manifold into some target manifold. In general, there is a Hamiltonian 

H = e T+-V[p] , 

where the kinetic term T is an operator which does not commute with ip and eo is 
what we will call the bare coupling constant. In fact, eo is not the standard coupling 
of some field theories, but we present the Hamiltonian in this form anyway (since it 
will be convenient in our analysis of the sigma model). Some regularization is always 
present in our discussion. 

To be clear as to what we mean by potential energy, the potential-energy operator 
V[(p] contains gradient terms. For example, a scalar theory has the potential energy 

V[<p] = jd D x{ l -[Vp b (x)f + V( Vb (x))}, 

in dimension D + 1, where the species are labeled by b and V is a local potential 
function. 

There is a natural notion of a distance between any pair of configurations. There is 
nothing vague about this idea as we will attempt to show. Suppose the spectrum is to 
be calculated with a strong-coupling expansion (say on a space lattice with continuous 
time). Such an approximation proceeds by first diagonalizing T, then treating V as 
a perturbation. One way to proceed towards solving for the spectrum is through the 
representation of the evolution operator through the Lie- Trotter product formula: 

U(t) = e lHt = lim ( e ^oTe e iVe/e \ N |^ =f ^ 
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which is then expanded in powers of V. Such an approximation is very crude and 
requires resummation techniques to be made of practical value; but such details, though 
mathematically detailed, are not conceptually sophisticated. In any case, at leading 
order, one expects rather generally that the matrix element of the exponential of the 
kinetic term between two states localized in configuration space, is of the form 

< ^i|e ieoT > 2 >~ (^^exp^pKH 2 , (2.1) 
zm e 2e 

where M is the (ultraviolet regulated) number of degrees of freedom. The function 
of two configurations p[<fi, ^2] can be identified as a distance. The expression ( |2.1| ) 
is difficult to evaluate exactly, and it is reasonable to expect that substitution of a 
function p'[(fi, ^2] in place of p[ip%, ip%] will make no difference as e — > 0, provided that 
this choice satisfies the criterion that as p[<pi, ^2] — * 

p'bi,^] - p[<fi,<f 2 } _^ 

P[<Pl,<f>2] 

In other words, as either distance vanishes, the two distances become indistinguishable. 
This indistinguishability has a geometric interpretation. It guarantees uniqueness of 
the infinitesimal metric, which on physical grounds should have the Riemannian form: 

dp 2 = p[cp, Lp + Sip] 2 = J d D x J d D y A) {y ^ B) 8(p A (x) 5<p B {y) , 

where the symbols A and B include species, Lorentz indices, etc. However, we will 
show that the general distance function p[-, ■} has its uses. 

Finding the distance is easy for the example of a scalar field theory. The kinetic 
term is 

T = -l [ d D x 



2 J 8ifb(x)8(fb(x)* ' 
A distance which satisfies ( |2.1|) is given by the Pythagorean expression 



p[<fi,f2Y = \\ d ° x WbiW - <Pb2{x)\ 2 • (2.2) 

For a gauge theory in temporal gauge, the physical degrees of freedom are not con- 
nections Ai(x) (where % — 1, . . . D is a space-coordinate index) in some Lie algebra, but 
orbits. These are equivalence classes of connections related by gauge transformations. 
The set whose elements are Af = g^A^g + ig~ 1 dig is an orbit a. A choice of distance 
between two orbits a, j3 is 

p[a, (3} 2 = inf ~ / d D x tr [A(x) - B^x)] 2 . (2.3) 
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This expression was considered by Babelon and Viallet [Tj| and later (and somewhat 
implicitly) by Feynman |]15| . It was proven to be a metric in the continuum (provided 
care is taken as to the mathematical details) in reference |T3| and a lattice analogue 



was discussed in reference [jTJ]]. Since (|2.3|) depends only on the gauge orbits a and 



(3 and not on the specific connections in those orbits, it is gauge invariant. Given 
any metric space, there is a second metric function which can be defined, called the 
intrinsic metric ||17|| . This latter metric is the length of the shortest path between two 
configurations (a consequence of the triangle inequality is that the intrinsic metric is 
always an upper bound on the original metric) . For gauge theories, the intrinsic metric 
coincides with the distance ( p73]) fT3|l , as had been conjectured earlier by Babelon and 
Viallet 0]. 



Once a distance satisfying (|2.1|) is found, other issues can be addressed. The po- 
tential energy can be thought of as a "height function" on the configuration space. We 
would like to use our intuition about topography, e.g. the hills, saddles and valleys of 
the height function on configuration space, to understand the properties of the spec- 
trum of the quantum theory. We give some examples of the application of this intuition 
below. 

Our first example is a one-component scalar field with a convex potential V((fi). In 
such a situation, the extension of the ground-state wave functional from the origin of 
configuration space (<p = 0) is controlled by the potential. Even if the (regularized) 
theory has a vanishing bare mass a gap will appear in the spectrum. Whether the 
interaction terms in the potential can survive as the cut-off is removed (and whether 
the renormalized theory has a gap) depends upon the anomalous dimensions of these 
terms. 

For our second example, we shall give a highly nonrigorous proof of Goldstone's 
theorem. Suppose, for a multi-component scalar theory there are inequivalent, con- 
tinuously parametrizable, degenerate minima of the potential, characterized by an 
expectation value v b = (ip h ) ^ 0. When the volume is finite, the true vacuum is non- 
degenerate. Let us consider the distance between two constant low-potential-energy 
configurations 

<fbi{x)=v b , (fb2(x)=v' b , \v\ 2 = \v'\ 2 . 

Then (12.21) becomes 



P&uVzf = j\v b ~ v' b \ 2 < KV . 

where V is the volume of space and if is a constant. The upper bound on quantity 
diverges in the thermodynamic limit. In quantum mechanics, a vanishing potential 
energy on a domain of diameter V KV implies a gap between the (nondegenerate) 
ground state and the first excited state of order 1/y/V. This gap vanishes and, by 
Lorentz invariance, the spectrum is completely continuous as V — > oo. 
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Actually the argument of the previous paragraph is a bit too crude. There are 
situations in which breaking of a continuous symmetry does not occur, despite the fact 
that the distance between pairs of low-potential-energy configurations is unbounded in 
the thermodynamic limit. In particular, when D = 1, a state localized near a given 
value of <f has low potential energy, but quantum fluctuations raise the expectation 
value of the kinetic energy. This latter energy is lowered through the formation of 
domain walls (these are point-like objects in one space dimension) between regions of 
different values of Vb- These domain walls cost almost no potential energy, as will be 
shown explicitly for the one-dimensional sigma models in sections 7 and 12. Their 
potential energy turns out to be of order 1/L, where L is the one-dimensional volume. 
Since they also lower the expectation value of the kinetic energy, they condense in 
the vacuum. We caution the reader that we use the term "domain wall" somewhat 
loosely and only for lack of a better name (more accurate but more cumbersome is 
"low-energy nonlinear wave"). What we are calling domain walls are not topologically 
stable objects, as are the domain walls of the Ising model, though we will show that 
they can be thought of as solitons. When the space dimension is two, a domain wall 
is a one-dimensional object in space. It now has a potential energy proportional to 
its length I, as well as inversely proportional to the diameter of two-dimensional space 
L (which in turn is proportional to the square root of the volume). This energy is 
therefore roughly ~ l/L. This energy will vanish in the infinite- volume limit only if 
the typical length I of a domain wall is not proportional to L. If it does not vanish, 
spontaneous breaking of a continuous symmetry is possible. 

We will now summarize the situation for our final examples, the 0(2) and 0(3) 
sigma models, for the benefit of the reader who would prefer to wade through the physi- 
cal ideas before plunging into the mathematics. We study the problem of minimization 
of the potential energy for fixed distance from a constant configuration (the classical 
vacuum). More precisely, we consider this problem using two different distances, one 
of which is the natural distance in the sense we have already discussed, as well as a 
second distance (due to Feynman [[TJJ) which is insensitive to global 0(n) transforma- 
tions. For either distance we find that the potential-energy surface has grooves or river 
valleys. The type of configuration on a river valley is specified by a parameter, or set of 
parameters (which are the invariants of an elliptic function ||18|| ). We now summarize 
how the configurations along a river valley appear as these parameters are tuned. The 
configuration begins as a constant or near constant. Then a weak, long- wavelength 
disturbance, or spin wave appears; this is nearly sinusoidal. Next the amplitude of the 
spin wave increases, but has nearly vanishing derivatives, except in a small region. This 
configuration is the domain wall mentioned in the previous paragraph. As the config- 
uration continues to move along the river valley, the domain wall narrows to a region 
of the size of the short- distance cut-off. Outside the domain wall, the configuration 
is nearly constant. At this stage the potential energy is enormous, and is only finite 
by virtue of the cut-off. Remarkably, the distance between this configuration and the 
constant configuration is infinitesimal, although the two configurations are separated 



5 



by a high, thin potential-energy barrier. At the top of the barrier, the configuration 
is not a domain wall at all (the value of the field on each side of this object is nearly 
the same!). We will call it the barrier configuration (this is probably similar to what 
Asorey and Falceto call a "sphaeleron" in the 0(3) model fi~2|l ). 

As we have already stated, the domain walls have nearly zero potential energy and 
always condense in the vacuum. Whether or not barrier configurations also condense 
depends on the dynamics of the particular sigma model. For the 0(2) model we com- 
pute the WKB tunneling amplitude through the barrier. We find that for a sufficiently 
large bare coupling constant, the barriers do condense. We show that tunneling through 
a barrier is a vortex ||. In this way, we obtain the first analytic demonstration of the 
phase transition within the Hamiltonian framework of this model. The 0(3) model is 
harder to deal with. However, we are able to find the explicit form of the river valleys. 
They sit together in a higher- dimensional subset of configuration space, which we call 
a river delta. Surprisingly, as the potential energy begins to grow, the river valleys in 
the river delta coalesce. At the top of the barrier, the configuration is similar to that 
of the 0(2) model. It is perhaps adequate to describe the barrier as 'Abelian". While 
we do not compute the WKB amplitude of penetrating the barrier, we do calculate the 
topological charge of the barrier-penetration process. It is one half. 



3 The sigma-model metrics 

We investigate two different choices of distance for the sigma model. Both of these are 
metrics, in that they satisfy the axioms for a metric space. The first metric is simpler 
and is easier to justify physically for the sigma model. We call it the physical metric. 
The second, invented by Feynman has the advantage of 0(n) global invariance. 



It is defined on pairs of points of orbit space, which will be defined below. While the 
second metric is no more calculationally powerful than the first and is less natural 
from the point of view of the sigma-model spectrum, we consider it for several reasons. 
First, Feynman claimed that a certain property is true of the potential-energy surface 
of the 0(3) sigma model using this metric. In fact, this is not correct and we feel it is 
necessary to explain some of the subtleties as to why not (this is done in section 11). 
Second and more important, Feynman's metric is nice from a mathematical point of 
view, and it is easier to visualize the potential energy on orbit space than on the space 
of field configurations. 

In this section and the two sections which follow we discuss some general properties 
of the metrics of the nonlinear sigma model in D space and one time dimension. We 
specify the space dimension D to be one in section 6. 

Let s(t, x) be a unit real n-vector-valued field on some D-dimensional space manifold 
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(not the space-time manifold, whose dimension is D + 1), written as 



/ si(t,x) \ 



s(t, x) 



\ s n (t,x) J 



with s(t, x) T s(t, x) = 1. 

The action of the 0(n) sigma model is 



S = — f dtd D x {d t s T d t s - V x s T ■ V x s) 



(3.1) 



Wherever a regularization is not explicit in our discussion, it will usually be implicit. 

The Hamiltonian is built from a similar unit- vector s(x), which is a c-number 
operator, as well as the angular-momentum operator 

L kl (x) = -i^ ka - k -s ka (x)'-' Skn _ 1 {x) <] 



$s kn (x) 



Explicitly, the Hamiltonian is 

H = I d D x[ ^-L T (x)L(x) + J-V x s(xf ■ V,s(x) 
The simplest distance one can define is the physical metric 



e T + —U 

e 



r[f, s] 



1 
2 



d u x [f(x) - s(x)} 1 [f(x) - s(x)} 



(3.2) 



this is the integral of the chord between two points on a sphere. It does not actually 
satisfy the usual properties of a metric space, unless one is willing to use a certain 
amount of real analysis. For example, it is necessary to identify two field configurations 
which are the same except on a set of measure zero (this is similar to how Hilbert- 
space vectors are constructed from Schrodinger wave functions in ordinary quantum 
mechanics). Alternatively, one can define the lattice metric over lattice points of space 
x: 



.d 



(3.3) 



where a is the lattice spacing. In any case, with the appropriate definition of ( |3.2D or 
with ( |3.3| ) it is easy to show the three metric properties for field configurations s, f 
and g: reflexivity 



r[f, s] = r[s,f] 
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positivity 



r[s,f]>0, (3.4) 

with equality holding only if s = f, and the triangle inequality 

r[s,f] +r[f,g] > r[s,g] . (3.5) 

Just as for gauge theories, one can define an orbit space for the sigma model. This 
is defined as a set of equivalence classes of field configurations. Two field configurations 
s and / are said to be equivalent if f(x) = R s(x), where R G 0(n) is a global rotation. 
This relation is obviously an equivalence relation. Each equivalence class is an orbit. If 
the set of unit vector fields is called S, the orbit space M. = S/0(n) is the set of orbits 

if) = { Rs(x) : R G 0{n) } . 

Two sigma-model fields s and / are equivalent if and only if there exists an orbit if) 
such that s G if) and / G if). 

The use of orbit space takes some justification. In the sigma model, unlike in 
gauge theories, not all states are singlets under the symmetry group. These states 
transform under some representation of this group (though if spontaneous symmetry 
breaking is absent, the ground state is a singlet, transforming under the trivial rep- 
resentation). There is however an obvious reason for studying the space Ai. The 
potential energy is invariant under 0(n) transformations. Thus as far as the spectrum 
of the potential-energy operator U is concerned, there is no physical difference between 
different elements of if). 

To better motivate the definition of Ai, we define a modified sigma model, which 
has a real antisymmetric n x n-matrix gauge field A(t) depending on time, but not on 
space. The Euclidean path integral is 

Z = Jvs(x,t)5(s T s-l) JvA(t)exp- J d D x J dt{—\[d t - A(t)]s(x,t)\ 2 
+ Vs(x,t) 2 } . (3.6) 

The gauge field has only a finite number (namely n(n — l)/2) of degrees of freedom. 
The path integral ( |3.6| ) is not Lorentz invariant. Nonetheless, the Green's functions 
are the same as those of the usual sigma model in the thermodynamic limit. For, 
by making a suitable gauge transformation A(t) — > g _1 (t)^4(t)g(t) — iq~ l {t)q{t) and 
s(x,t) — > q~ 1 (t)s(x, t), where q(t) is an orthogonal matrix, A can be set to zero for 
almost every time t. Thus ( |3.6|) can be reduced to the usual path integral except at a 
boundary chosen at some particular value of t. 

There is a lattice version of (|3.6j ). The continuum coordinates x,t will now be 
replaced by lattice coordinates. These will also be written as x, t, but are integers, 
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equal to the corresponding continuum coordinates divided by the lattice spacing a. 
The lattice path integral is 



JJ f d n s(x,t) 5(s(x,t) T s(x,t) 



X 



II / dR(t) 



,D-l 



exp 



2e o ^ 

+ ^2[s(x + i,t) — s(x, t)] T [s(x +i,t) — s(x, t)}} 



D 



^2{[s(x, t + 1)- R(t)s(x, t)f[s(x, t + 1)- R(t)s(x, t)] 

(3.7) 



8=1 



where the measure of integration over the lattice gauge field R is the 0(n) Haar mea- 
sure. Let us ask how the variable R(t) behaves if no gauge fixing is imposed. It is 
clear that in the semiclassical approximation, the configurations with R(t) chosen to 
minimize J2 x [ s ( x i t + 1) — R(t)s(x, t)] T [s(x, t + 1) — R(t)s(x, t)} will dominate. Suppose 
then that for some fixed choice of t and s(x,t), R(t) = R(t) is this minimum. Note 
that R is unique. This is because in the continuous-time limit R is the solution of 
a first-order ordinary differential equation. The fluctuations around R have a coeffi- 
cient proportional to the size of physical space. Consequently, these fluctuations are 
suppressed in the thermodynamic limit. Therefore ( |3.7|) can be replaced by 



JJ / d n s(x,t) 5(s(x,t) T s(x,t) - 1) 

x,t 



x exp 



t + 1) - R(t)s(x, t)} T [s(x, t+1)- R(t)s(x, t)] 



2e o ^ 

+ ^2\s(x + i,t) — s(x, t)] T [s(x + i,t) — s(x, t)}} , 

i 

Thus the path integral may be regarded as the Wick rotation of quantum mechanics 
of a particle with mass 1 in a space in which the distance is p[-, •], defined by 



P\<PM 2 = v „¥ t , E WW - s(x)f[Rf(x) - s(x)} 



(3i 



where f(x) G <f) and s(x) G ip. The classical kinetic energy of a time-dependent orbit 
is the arc-length of the curve this orbit "traces out" in orbit space. 

We will now show that on the lattice p[-, ■] is a metric on orbit space. The orbit 
space is therefore a metric space. There are three properties which must hold for this 
to be true. Obviously for any two orbits and ip, 



p[<t>M = pbP,<f>) 



(3.9) 
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It is straightforward to check that for these orbits 



p[<M>0, (3.10) 

with equality holding only if <fi — ip. Finally, suppose that <p, ip and r are orbits. Let 
f(x), s(x) and t(x) be unit vector fields on the lattice such that f(x) G (ft, s(x) G ip 
and g(x) G 7. The triangle inequality (|3.5| ) for field configurations implies 

E [WW - s(x)f[Rf(x) - s(x)} + £ [s(x) - R'g{x)} T [s{x) - R'g(x)} 

X X 

> E [Rf(x) - R'g(x)] T [Rf(x) - R'g(x)] , 

X 

for any R, R' in 0(n). Taking the greatest lower bound of the sum of first two terms 
over R and R' gives 

p[&V]+PfoM >P[<M • (3.11) 



Now that the three properties fl3.9|), ( ^.10| ) and ( ^.11| ) are proved, it follows that p[- 



is a metric and that orbit space is a metric space. 

The continuum limit of (|3.8| ), dropping factors of the lattice spacing, is 



p[0,^] 2 = inf ^ [d D x [Rf(x) - s(x)f[Rf(x) - s(x)} . (3.12) 

This expression coincides with the "minimal distance" for the sigma model Feynman 
attempts to estimate in the appendix to his article flI5| . 

Strictly speaking, in the continuum neither (|3.2j ) nor (|3.12j) is actually a metric, 



unless we are careful about configurations differing on sets of measure zero (otherwise, 
the inequalities ( |3.4|) and ( |3.10|) can be saturated by distinct configurations). If this 
issue is properly dealt with, the potential-energy function is infinitely discontinuous 
on the space of configurations (this will be explained later in this article). However, a 
regularization will make this discontinuity finite or can remove it altogether. 

4 Evaluation of Feynman's orbit space metric 

Unlike the case of the Yang-Mills metric, it is possible to evaluate (|3.2j) (and (|3.8| )) 
explicitly. Let us first rewrite ( |3.2| ) as 

p[<f>, = inf / d D x [1 - s(x) T Rf(x)] . (4.1) 
Define the matrix M by 

M kl = [ d D xf k (x) Sl (x) . 
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Then 



p[(j)^] 2 = V- sup trRM , 

ReO(n) 

where V is the volume of space. The 0(n)-covariant variation of R is R — > S'jR, 
where £7 is an infinitesimal antisymmetric matrix. The condition that trRM be a 
local extremum is therefore that RM is a real symmetric matrix, i.e. (RM) T = RM. 
Therefore, there exists an orthogonal matrix P which diagonalizes RM: 



RM = P T AP , A 



/ Ai 
A 2 



V 



\ 




Then tr RM = J2k Afc- In fact, the absolute maximum of the trace coincides with the 
supremum. For this case, all the diagonal entries in A are nonnegative. To see that 
this is so, assume the contrary. If is negative for a particular k between one and n, 
R can be replaced by another 0(n) matrix: 



/ 1 



R 



\ 



-1 



V 



R 



1 J 



where the —1 appears in the k th row and k th column. Such a transformation must 
increase the value of tr RM. Thus, if R is chosen to absolutely maximize the number 
tr RM, all the entries of A are positive definite. Now M T M = P T A 2 P, and the matrix 
RM has only positive eigenvalues. Therefore 



trRM = try/ M T M . 



(4.2) 



The meaning of ( |4.2| ) should be clear; the right-hand side is the sum of the square roots 
of the eigenvalues of the symmetric matrix M T M. Since this expression is unique, it 
must coincide with the supremum. Thus (EO) reduces to 



p[0, = V - tr VM T M = V -trJ d D x d D y f(x) [s(x) T s(y))f(y) T 



(4.3) 
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Having carefully derived ( |4.3|) , let us note that in retrospect it is obvious. It is the 
only positive-definite 0(n)-invariant expression with the correct dimensions satisfying 

There is no essential complication in taking the analysis above to the lattice. Then 
( [4.3D is replaced by 

p[0, <A] 2 = C - tr /££ f(xMx)Ts(y)]f(y)T , 
y x y 

where C is the total number of lattice sites. This expression is equal to ( |3.8| ) . 



5 The infinitesimal metrics 

In this section, we discuss the Riemannian form of our metrics for small separations. To 
do this rigorously, a lattice or alternatively Hilbert-space techniques must be used |13| . 



We leave it to the more mathematically oriented reader to fill in the gaps. This section 
is not necessary to understand the rest of the article. However, in the arguments and 
calculations which follow we often discuss curves and functions on configuration space. 
While these can be made well-defined concepts on general metric spaces IJT7J, the reader 



may feel more comfortable knowing that differential-geometric intuition applies as well. 
The functional Riemannian metric on configuration space is directly written down 



from ( 3.2 



where 5s satisfies 



dr 2 = - J d D x 5s T (x)5s(x) 



d°x Ss 1 (x)s(x) = . 



There is no more work to be done for this case. 

The functional Riemannian metric on orbit space has the unusual feature that the 
metric tensor has zero eigenvalues. The corresponding zero modes are the directions 
along symmetry transformations. In other words, these zero modes are motions along 
the fibers of the fiber bundle of field configurations. The reader can find a general 
discussion of such metric tensors in the appendix of reference ||13[| , though the essential 
idea of using a bilinear form with zero eigenvalues as an inner product was discussed 
much earlier |T3 . Since the sigma- model orbit space is not very difficult to understand, 



the standard of rigor in this section is considerably less than that used for the Yang- 



Mills orbit space [13 



Let us find the infinitesimal form of the metric on orbits and ip which are in- 
finitesimally close, i.e., ( [4.3[ ) is very small. Then there must exist representatives of <fi 
and ip, which we call / and s and which satisfy 

8 = f + 8f. 
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Given /, we can in principle find 5f by rotating s by an 0(n) transformation minimizing 

Since / and s = f+df are unit vectors, we can write 5f in terms of an antisymmetric 
tensor W of rank n — 2: 

Sf( x ) = e wi a -i»-i W jlj2 ... jn _ 2 (x) f^(x) . 

Global 0(n) transformations can be parametrized as R = e h , where h is a constant 
antisymmetric matrix, parametrized by a constant tensor b of rank n — 2 as 

iik _ mh-jn-ik j 

The metric evaluated on <p and t/> is 

dp 2 = pfa = min i | r/'^.r { e ^-J->*[ ,(-,'! - 6 J1J2 ... Jn _ 2 /(.r),} 2 

= mini | ^[W^^C^-ft^"^] 2 , (5.1) 

where square brackets around indices denote antisymmetrization. 

Carrying out the minimization with respect to b in ( |5.1D is straightforward. The 
result for the 0(2) sigma model is 

dp 2 = J d D x J d D y G a (x) a {y) 8a(x) 6a(y) , 

where 

X i 

G a (x)a(y) = ^ (x - y) - — , 

and the angle a(x) is defined by /i(x) = sina(a;) and f2(x) = cosa(x). Notice that 
this functional metric tensor G has no dependence on a, and is therefore flat. The zero 
modes are constant rotations of a(x). For the 0(3) model 

dp 2 = J ' d D x J d D y G si(x)sj(y) 6f i (x)5f j (y) , 

where 

I i 

G s i ix)s i iy ) = -S D (x-y)5 lj - ^e lkl f k (x)(B- 1 ) lm e jmr f r (y) , 

where the three-by-three matrix B is 

B kl =5 kl_^_J d D zfk{z) f l {z y 

It can be checked that the zero modes of the metric tensor are 0(3) rotations. The 
metric tensor considered as a matrix in function space is idempotent, G 2 = G. A 
general treatment of how the Laplacian and the curvature may be determined is given 
in the appendix to reference ]T3| . One can go further in the case of the 0(3) sigma model 
and write the metric tensor in angular coordinates instead of unit-vector coordinates, 
as was done above for the 0(2) case. 
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6 The potential-energy surface and the classical pen- 
dulum 



Let us denote the "pure gauge" or "classical vacuum" orbit containing constant so(x) = 
So by ipQ. Consider now the following problem for D = 1 with V — L, the length of 
one-dimensional space. For fixed p[ip,ipo\, extremize the potential energy 



/ dx 


ds(x) 
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dx 



s G ip . 



Let us parametrize s(x) using angles ^(x),..., by 

/ sin^i(x)... sin^ n „i(x) \ 
sin^i(x)... cos£ n _i(x 

six) 



V 



cos 



in the standard way. For 0(2) we will label £i as a and for 0(3) we will label £i 
and ^2 as £ and k, respectively. The problem will be considered with the periodic 
boundary condition s(x) = s(x + L), though more general boundary conditions will 



also be discussed for the 0(3) case (in reference |14| Neumann boundary conditions 
were considered). 

If s(x) is any representative of ip, then 



L — 



s(x)dx\ 



(6.1) 



from ( |4.3| ). The problem is therefore equivalent to the extremization of the functional 



A[s] 



L ds(x) T ds(x) 
u dx dx 



+ A 



s(x)dx\ — v 



(6.2) 



where v = L—p[ip, ipo} 2 and A is a Lagrange multiplier. Suppose that a solution has been 



found. Then by rotating this solution, we can make J s J 
and J L s n (x) 



0, for j 



l,...,n 



v. The solution for the extrema of Ufa] for fixed p = p[ijj , i/jq] (this 
defines a sphere whose center is the point ^o) is then described by the motion of 
an n-dimensional pendulum, which can be written in terms of elliptic functions and 
integrals. 

We can choose not to work on orbit space but instead asked the same question 
on configuration space. In that case we need to extremize the potential energy, while 
holding fixed the physical metric distance from a constant configuration sq: 



s(x) - s ] T [s(x) - s ] 



Sqs(x) + L 
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Such a constraint included with a Lagrange multiplier is once again the n-dimensional 
pendulum equation. We therefore see that the solutions we will obtain for either 
problem are the same. A solution of the problem in orbit space is a set of solutions in 
configuration space; the latter solutions are all the same up to global rotations. 

An n-dimensional pendulum is a massive particle on the sphere S n -± embedded 
in R n under the influence of a constant gravitational force. The coordinates are Sj, 
i = l,...,n with J2i = 1. For simplicity, this force can be taken to be along the 
n-axis. 



The significance of elliptic functions [18| in the n-dimensional pendulum problem 



can be understood physically. There is one variable describing the "height" of the 
particle, namely the n th coordinate, s n . We will explain in a moment why the evolution 
of s n is periodic, i.e. s n (t + T) = s n (t), where T is the period. The behavior of other 
variables describing the pendulum (if n > 2) is not, in general, periodic. However, 
the evolution of these variables must satisfy certain conservation laws, namely the 
conservation of (n — l){n — 2)/2 of the n(n — 1)/2 components of the angular-momentum 
tensor. The conserved components are kj = SiPj — piSj where % and j are less than n. 
The other variables influence s n only through these conserved components. In other 
words the equation of motion of s n can be written so that kj can be substituted for 
the other degrees of freedom. Any solution of this equation is obviously periodic. 

Now imagine turning the pendulum upside down; this means reversing the direction 
of the gravitational field. The evolution of s n is again periodic with a new period T' . 
A field reversal is equivalent to the Wick rotation t —*■ it. The equation of motion 
contains one term with a second time-derivative. Reversing the direction of the field is 
equivalent to changing the sign of this second time-derivative. This implies that s n (t) 
in the original pendulum (that is, before it was turned upside down) has an imaginary 
period iT' as well as the real period T. If this function has no singularities in the 
complex t-plane other than poles, it must be an elliptic function. 



7 River Valleys of the 0(2) model and solitons 

Next we shall investigate the potential-energy surface of the 0(2) nonlinear sigma 
model or classical XY model. More precisely, we shall examine the pendulum solu- 
tions which are the extrema of ( |6.2| ). Depending upon which is easier, sometimes we 
will discuss the extrema in orbit space and sometimes we will discuss the extrema in 
configuration space. As we pointed out in the previous section, the latter are contained 
in the former. 

The planar pendulum displays two types of motion, namely oscillating and circulat- 
ing. The lowest-energy configuration is that for which the pendulum sits at the nadir 
for all time. As the energy is increased, the pendulum makes small harmonic oscilla- 
tions around the nadir, which can be described by trigonometric functions. Increasing 
the energy further leads to an increase in the period, which is an elliptic integral, even- 
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tually leading to a situation in which the pendulum will spend most of the time near the 
zenith, but on occasion will rapidly sweep out an angle of nearly 2n. A slight further 
increase in energy causes the pendulum to pass through the zenith, leading to circulat- 
ing, rather than oscillating motion. The pendulum still spends most of its time near 
the zenith, but the motion is now consistently clockwise or counter-clockwise. As the 
energy continues to increase the angular frequency of this circulating motion becomes 
a constant. Once more the motion can be described by trigonometric functions. 

The mathematical translation of the discussion in the previous paragraph is the 
following. For the pendulum Lagrangian 



L = -a 2 — — cos a) 



the classical energy is given by 

Epend 



a + — cos a) 



There are oscillating solutions to the equations of motion 

a(t) = 2 sin -1 k sn{ky/Jit^ k) , 

where sn(u, k), sometimes written snu, is the Jacobi elliptic-sine function, and E pen( i = 
2\xk 2 . The modulus k is between zero and one. Increasing the energy means increasing 
k. The period is 4K, where K is the complete elliptic integral K(k) = sn~ l l. This 
diverges logarithmically as k approaches one (asymptotically, K ~ log4/fc', where 
{k') 2 = 1 — k 2 ). At k = 1, these solutions are joined onto the k = 1 circulating 
solutions 

a(t) = 2 sin -1 |sn(/c _1 ^[Jlt, k) , | 

where E pen d = 2/i/fc 2 . The reason for the absolute value in this expression is that there 
are no turning points and the left-hand side is discontinuous with a discontinuity of 
2tt. As k is then decreased the energy increases further, the angular velocity eventually 
becoming a constant. 

Now let us to turn to the extrema of ( p-2\ ), where time t is replaced by space x 
(up to a constant) and kinetic energy (up to another constant) is now the sigma-model 
potential-energy density. We take periodic boundary conditions. Up to global rotations 
R there are two types of solution labeled by an integer N = 1, 2,... . 



a(x) = ±a°^ c (x, k, x ) 



±2 sin 1 k sn 



ANK 



(X - Xq) 



which resemble the oscillating solutions of the pendulum and 

-2NK , 



a(x) = ±a c ^ rc (x, k, x ) = ±2 sin 1 \sn 



[x - x ) 



(7.1) 



(7.2) 
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Circulating 




Figure 1: Two typical extremal curves for the 0(2) sigma model. In the thermodynamic 
limit, the potential energy is vanishing along the solid (oscillating) and dashed (circu- 
lating) curves where k is not close to one. The potential energy is ultraviolet divergent 
in the region where k is close to one. 



which resemble the circulating solutions of the pendulum. These extremal curves in 
orbit space are nicely parametrized by the modulus k as shown in Figure 1. Note: not 
all of these curves are the river valleys mentioned in section two. 

We will next investigate the properties of each branch of solutions. First let us 
consider the oscillating branch ( |7. 1|) . We denote the particular value of k where E = 2K 
by k*, where 

E = E(k) = f 1 dn 2 udu 
Jo 

is another standard elliptic integral, not to be confused with the energy E pen d. The 
number k* has the numerical value k* « 0.82. For < k < k* the distance from ipo is 
independent of N and has the form 

p osc (kf = L-\ j\osa% c dx\ =2l(i-—) , 
while for k* < k < 1 

p osc (kf =L- \ [ L cos a°^dx\ = 2L% . (7.3) 
jo K 

The function p(k) rises smoothly from to L as k goes from to k*, then falls off to 
zero again as k — > 1. Notice that at k — 1 the orbit is identical to that at k — 0. 
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An orbit along a extremal curve is maximally far from the origin at k = k*. The 
potential-energy functional U — \ J L (^) 2 <ix for the oscillating branch ( [7.1| ) is 

LJ 

where k' 2 = 1 — k 2 . For fixed volume L, U osc (k) diverges at k = 1, but, as mentioned 
earlier, this divergence is regularized by a lattice (or some other ultraviolet cut-off). 

Physically the oscillating branch of solutions ( |7.1|) is a spin wave of wavelength L/N. 
As k — > 0, the spin wave dies off in amplitude, approaching ipo- As k — > 1, the spin 
wave begins to wind. If the angle a is represented on a circle, then the configuration 
(up to global rotations) is a curve on a cylinder. This is shown in Figure 2abc. However 
the k = 1 limit is not a simple kink of winding number N. For not only is the curve 
in Figure 2c beginning to wind, but the amplitude is nearly a constant for all x. In 
other words, the spin wave has become a domain wall of the type discussed in section 
2. The potential energy of the domain walls, like those of the spin waves is very small. 
The winding takes place in narrow regions of space. As k approaches one, the width 
of these regions collapses to zero. Thus the k = 1 solution coincides with the k = 
solution. Since the potential energy diverges as k approaches one, we can see that 
there is an infinite discontinuity in the potential-energy function on orbit space. This 
discontinuity can be removed or made finite with an ultraviolet cut-off. 

For the circulating branch the distance from i/jq is given by 

p circ (kf = L- \ [ L cosa% rc dx\ = 2 -^-(%- k' 2 



k 2 \K 

while the potential energy is 

L 

Again there is an ultraviolet divergence at k — 1. 

The circulating branch of solutions wind around the cylinder (as shown in Figure 
2def) N times. In other words, these are configurations with iV kinks. As k — > 1, the 
potential energy diverges, but the orbit approaches ip . This is because the regions 
where the kinks occur will narrow until they disappear. As k decreases, the potential 
energy decreases as well, but p circ (k) becomes of order \f~L. What happens is that the 
derivative of a becomes of order ^ in this limit. 

Not all of the extremal configurations we have constructed are minima of the po- 
tential energy on a sphere in orbit space. This issue is examined in detail in Appendix 
A. We find that all the circulating solutions ( [7.2| ) are indeed local minima on the 
orbit-space sphere, for any value of k. Consequently, these are river valleys (note: in 
reference \TA\ the term "river valley" was used for all the extremal curves). We also 



show that the N = 1 oscillating solution (|7.1| ) is a local minimum on the orbit-space 
sphere for sufficiently small k. 
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Figure 2: How extremal- curve configurations depend on k. Here we show a path in 
configuration space, starting from ipo, moving away as an oscillating configuration, 
reaching ip again, then moving away as a circulating configuration. In each case, the 
horizontal axis of the cylinder is x, while the angle is a. For the oscillating solutions, 
small k implies a small amplitude spin wave (a), and as k increases the potential energy 
becomes more localized into a domain wall (b). When k is close to one, short kinks begin 
to form (c), while away from these kinks, the configuration is nearly constant. For the 
circulating solutions with k close to one (d), the kinks are fully formed and the winding 
number is N (the cylinder in this figure is too short to show the remaining kinks). 
Decreasing k does not change the number of windings, but the kinks begin to spread out 
into domain walls (e). As k decreases further, the derivatives become of order 1/L for 
all x. If k ss 1 (c and d) the potential energy becomes divergent; otherwise it vanishes 
as L — > oo. 

We will now interpret our results thus far. We have found that the potential energy 
on orbit space U[ip] has one-dimensional valleys. In the thermodynamic limit L — > oo, 
the bottoms of these valleys are flat (the potential energy vanishes there) except for 
the special region where k approaches unity. The k = 1 point is actually ipo- The 
energy of a point in a river valley diverges as ipo is approached. Therefore the potential 
energy at ip is discontinuous. If a regularization is introduced, the energy vanishes at 
■00) but for k = 1 — e, where e is determined by the ultraviolet cut-off, the river- valley 
potential energy rapidly rises to a large number. Even in this region, the energy must 
be a local minimum in the direction perpendicular to the river valley. 

For large volume, the potential energy is almost constant nearly everywhere in 
a river valley. The one-dimensional domain where this is so has length 0{\fZ). If 
we naively view A; as a collective variable, and ignore fluctuations in other degrees of 
freedom, the gap is of order 0(j). 
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We note that the extremal curves are not straight lines in configuration space. Their 
tangent vectors at k for the oscillating solution ( |7.1| ) are 

R osc lrr h x _ da T °( x i k ) _ Zsnudnu- Z(u)cnu 

p ww- dk - 1 _ k 2 > y< A ) 

where u = 4K(x — Xq) and Z(u) is the Jacobi zeta function. The inner product 
of and its derivative with respect to k is not zero, which means that the ex- 
tremal curves have curvature. One can define the unit tangent vector /3(x,k) = 

P(x,k)/^J L p(y,kydy. 

All extremal-curve configurations, including those of river valleys, are related to 
solitons in a finite volume. Consider adding to the 0(2) sigma model action (|3.1|) 
an external source h (from the viewpoint of lattice spin systems, this is an external 
magnetic field), modifying it to 

S = — [dtdx{ d t s T d t s - d x s T d x s + h T s) . (7.5) 
2eo J 

This is the sine-Gordon action, with the different classical vacuua a(x) = 0, ±2ir, . . . 
identified (in this respect it is closer to the textbook example of the twisting band than 
the usual sine-Gordon action). Without loss of generality, the direction of h can be 
chosen so that (17.51) reduces to 



S = - — J dtdx {{d t a) 2 — (d x a) 2 + h cos a] , 

and the equation of motion is the sine-Gordon equation, in the compact field a: 

(d 2 t -3%)a = -hsina . (7.6) 
Substituting a(t, x) = a(u), where u — t ± - into ( [7.6| ) yields 

v 2 h 



d 2 a(u) 



v 2 -V 



which is the equation of motion of the planar pendulum. The solutions to this equation 
are a train of evenly-spaced solitons, of the form 

« = 2sm ksn\ — - — , k ] , (7.7) 



or 



a — 2sm \sn\ — - — , k \ \ , [7.1 
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where the modulus k is a free parameter between zero and one and the spacing between 
solitons is given by 



S32K 2 (v 2 - I) 



' v 2 h 

The solitons are located at the nodes u = 0, ±Z, ±21,... . The k — * 1 limit of either 
( |7.7|) or (|7.8|) reduces to the standard one-soliton solution. 

Unlike the solutions ( [7.7| ) and ( [7.8| ), the extremal-curve configurations ( |7. 1| ) and 
( |7.2|) have no explicit time dependence. However, they do contain the translation 



parameter x$. Thus the soliton solutions of ( [7.5[ ) really can be thought of as extremal- 
curve configurations with moving xq. Conversely, a river- valley configuration is just a 
"snapshot" of a soliton configuration (|7.8|) at a given time. The solutions (|7. 1|) contain 
2N solitons (actually they are solitons alternating with anti-solitons) and ( |7.2| ) contain 
iV solitons. 

The reason we emphasize the mathematical resemblance between extremal-curve 
configurations and solitons is to give a physical picture of quantum barrier penetra- 
tion, which is discussed in the next section. The river-valley approaches the trivial 



configuration ip as k tends to one. Thus a soliton configuration (^8), with diverging 
potential energy can be made arbitrarily close to ipo (meaning that the metric separa- 
tion in configuration space between the orbit containing the soliton configuration and 
ipo is small). 



8 Barrier penetration and vortices 

We have gone to a lot of trouble to investigate the extremal curves, and if the reader 
has been patient enough to follow our discussion thus far, we will now explain their 
physical significance. 

An orbit in an extremal curve has very small potential energy, unless it approaches 
ijj from particular directions, corresponding to k close to one. For small k superposi- 
tions of extremal-curve configurations, 

a = ai«i + a2«2 + • • • , (8-1) 

where the real constants a\, 02,... are not too large, are also configurations of small 
potential energy. There is nothing surprising in this fact. For small k, Jacobi elliptic 
functions become trigonometric functions and flS.l|) is a general spin-wave configura- 
tion. For small amplitude waves, the sigma model is approximately a free field theory. 
However, when k is large, the extremal-curve configurations cannot be superposed as 
in ( |8.1| ), because their nonlinear character is important. 

We now make the following 
Assertion: In the Schrodinger representation of the XY model, the vacuum wave 
functional can be significantly different from zero for only two kinds of field configura- 
tions. These are: 
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1. Spin waves, i.e. configurations of the form ( |8.1| ). 

2. River-valley configurations with k very close to one, as well as slight deformations 
of such configurations. 

By "slight deformations" we mean two things: 

• First, small- amplitude spin waves can be added to large k river- valley configura- 
tions without significantly changing the potential energy 

• Second, river-valley configurations are very different from zero only in small re- 
gions of space where kinks begin to appear (we have shown in the last section 
that these kinks can be thought of as solitons). There is no significant gain in 
either the potential energy or in the distance from ipo if the position of these 
regions is changed. 

To see why the assertion must be true, let us first examine each case 1. and 2. 
individually. Clearly configurations of type 1. which have small potential energy must 
exist as fluctuations around ipo. There is nothing to suppress such fluctuations. Why 
should configurations of type 2. be important? They should be highly suppressed by 
virtue of their huge potential energy. However, they may not be completely suppressed. 
The reason is just that the orbits containing these configurations are close to i/)o i n orbit 
space. Even if quantum fluctuations from to one of these orbits may be small, there 
is the possibility that they do not disappear entirely. Only the configurations 1. and 
2. are either of small potential energy or close to i/jq. Therefore the vacuum wave 
functional should be vanishly small when evaluated for any configuration other than 
1. and 2. 

Whether or not configurations of type 2. indeed appear through quantum fluctu- 
ations depends upon the barrier-penetration amplitude. It is the computation of this 
amplitude we examine in the next section. However, first we will discuss further the 
physical interpretation of the barrier penetration. The energy barriers are more than 
just the k ~ 1 region of river valleys (|7.2|) . Consider the point of intersection ip of 
a sphere whose center is ipo and a river valley. Let G be a small neighborhood of ip 
in the sphere (see Figure 3). If this neighborhood G is sufficiently small, the point of 
intersection ip is actually a minimum of potential energy in G. The k « 1 part of a 
river valley is a path of least resistance through the barrier. 

A typical tunneling event can be visualized with the aid of either Figures 2abc or 
2def. A constant configuration makes a transition to the configuration 2c (2d), which 
contains short localized regions of large potential energy. These regions stretch out in 
physical space, meaning that the configuration evolves first to 2b (2e), then 2a (2f) 
and finally becomes nearly asymptotically constant (winding with minimal derivatives 
with respect to i). 

The amount of time required to make the initial transition to 2c (2d) depends on 
the coupling e$ and the nature of the cut-off. In a sensible regularization, the potential 
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river valley 



Figure 3: River valleys for k « 1 are paths of lowest potential energy through the ultra- 
violet divergent energy barriers. The potential at ip is a minimum in the neighborhood 
G in the sphere of constant metric distance from ip . 

energy in the small region 1 > k > 1* can be set to zero, where 1* is a number very 
close to one but strictly less than one. The energy barrier is thereby rendered finite. 
If the initial state is a delta function localized in ip , i.e. k = 1, the orbit will lie in 
a spreading wave packet, which will eventually reach 1*. Now e^ 1 plays the role of 
mass in this quantum-mechanical system. The form of this wave packet is therefore, 
for short times, 



The separation of these two points in orbit space is Therefore, the typical time 

an orbit takes to travel from k = 1 to k = 1* is proportional to eop(l*) 2 . For small Cq 
the transition is very sudden, and the tunneling process (shown on a lattice) resembles 
that of Figure 4. At stronger coupling, this transition time is large, but still finite. 

The tunneling process is a vortex ||. The closed line integral of the gradient of a 
enclosing the "core" of the vortex / V# • dl, where the topology of the one-dimensional 
configuration changes, is 2tc. From Figure 4, the reader can see that our vortex is 
asymmetrical, unlike Berezinskii-Kosterlitz-Thouless vortices, which are nearly rota- 
tional invariant (Figure 5). In the tunneling process corresponding to the latter, the 
topology change does not happen suddenly, even for small e^. The two types of vortices 
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Figure 4: The tunneling process at weak coupling. Time is represented by the vertical 
direction, and a lattice is used for ease of visualization. The initial configuration is 
constant. The vortex (located at the letter V) appears suddenly. Then the configuration 
slowly decays back to a constant. As the coupling increases, the number of lattice 
spacings between the initial constant configuration and the vortex also increases. 

do not resemble each other very much, except at infinitely-strong coupling. Actually, 
Euclidean field configurations which dominate the lattice partition function are not 
isolated vortices, but a superposition of vortices along with spin waves. Most of these 
resemble neither our asymmetrical vortex of Figure 4 nor the more standard variety of 
Figure 5. 

9 The tunneling amplitude in the 0(2) model 

The ultraviolet divergence in the barrier height is due to the fact that the derivative 
da ^' k ) diverges as k — > 1. Any sensible regularization imposes the restriction 



where a is the short- distance cut-off and Q is a constant depending on the details of 
the regularization. For example, on a lattice, a is the lattice spacing and the difference 
in a at adjacent lattice sites is at most tt, so that Q = n/2. 

For the circulating river- valley solution with one soliton, that is (|7.2j ) with N = 1, 




(9.1) 





2K, , \2K 



(x -x ) | < 



2Q 



a 
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Figure 5: The Berezinskii-Kosterlitz-Thouless vortex. The picture of the tunneling 
process is similar, except that it is nearly rotationally invariant. The vortex appears 
halfway between constant configurations during a long time evolution. 



or 



K(k) < 



QL 



The path in orbit space of the tunneling is along the river valley from k = 1* to 
some value of k = k less than 1*, for fixed xq. Recall that the role of the mass is played 
by Cq . The WKB formula for the tunneling amplitude for fixed xq is 



T = exp-^, W= I dk 

Jk 




f3(x, k) 2 dx 



\ 



2 

e 



U(k) U(k) 



e 



e 



(9.2) 



where f3 is the tangent vector to the river valley defined in (7.4). The square root of 
the length of the tangent vector is included in the integrand so that the integration 
measure is the orbit-space line element. 

A useful approximation is the replacement 




dk \ / 1 j3(x, k) 2 dx 



dp{k) = <WR d k 

dk 



in (|9.2|) . This approximation is valid because what remains in the integrand is small 
unless p{k) is small. In this regime, p(k) is given by ( |7.3| ). 

It is convenient to use the formulas for the derivatives of the complete elliptic 
integrals of the first and second kind 



dE 



E-K 
2k 2 



d{k 2 



dK 



E-k' 2 K 
2k 2 k' 2 



d{k 2 
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the approximations valid when k ~ 1 that 



E fts 1 , AT « log — , 

and to make a change of variable from k to if. We also assume that in the dominant 
part of the range of integration U(k) can be neglected. Then ( |9.2|) becomes 

u/ r¥ 2^2 2V2. QL 

W « / — — dif « log . (9.3) 

JK(k) eoK eo a 

Notice that (|9.3j ) implies that the WKB tunneling amplitude at any given point 
vanishes in the thermodynamic limit. The alert reader may have noticed that W is 
similar to the usual expression for the vortex action [[J. 



10 The phase transition in the 0(2) model 

We will now explain the Kosterlitz-Thouless transition using our Hamiltonian quantum- 
mechanical methods. 

Thus far we have considered a change in the winding around the cylinder at a specific 
location. To determine the total transition amplitude it is necessary to sum over all 
the possibilities of this location. On a lattice, the number of such possibilities is L/a. 
With a different regularization, this number should be L divided by the short-distance 
cut-off and multiplied by some (non-universal) constant Y. In any case 

L 2s/2 I Li\ e n 

L T = YQ-— (-) . (10.1) 



a \a 



In the thermodynamic limit L —>■ oo this amplitude vanishes unless eo is greater 
than its critical value 2y / 2 (as pointed out earlier, this number is not universal). For 
small eo, tunnelings are rare and the vacuum wave functional is Gaussian. In this 
weak-coupling phase, winding modes or solitons are stable particles. For sufficiently 
large eo the solitons condense in the vacuum, disordering correlation functions. The 
quantity in the exponential of (11). 1) is really not very different from Kosterlitz and 



Thouless' famous mean-field free-energy estimate. Having said this, we started from a 
different point of view, which will later prove useful for the 0(3) sigma model. 

Many years ago, soliton condensation in the Schrodinger picture was argued to be 



responsible for the phase transition of the Hamiltonian XY model p(J . Our calculation 
shows that this is the case. Our solitons ( |7.7| ), ( [7.8| ) are not classical solutions of the XY 
model, but the associated model (|7.5|) . We will later argue that soliton condensation 
takes place in the 0(3) sigma model for any coupling (at topological angle 9 = 0). 
Once again, these are not solitons of the classical 0(3) sigma model, but are solutions 
of a related classical field theory. 
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The most obvious discrepancy between our result ( |9.3|) and the energy-entropy 
argument of Kosterlitz and Thouless is that we find a critical coupling of eo = 2\/2 = 
2.83 instead of eo = tt/2 = 1.57. This need not concern us, for this number is not 
universal. A recent Monte-Carlo computation |21] for the square-lattice Villain model 
indicates a critical coupling of eo = 1.33. The numerical result certainly disagrees with 
both analytical estimates. 



11 River deltas and solitons in the 0(3) sigma- model 



Next let us turn to the 0(3) case. The extrema of (|6.2|) are related to solutions of the 
spherical pendulum (see for example Whittaker and Appendix B in this article). 

The main new ingredient in the spherical pendulum is the presence of a new con- 
served quantity. This quantity is the angular momentum about the vertical axis. It 
is not possible for the pendulum to reach the zenith or nadir on the sphere unless the 
angular momentum is zero. This is easy to understand physically. If the pendulum is 
placed at either point, the angular momentum about the vertical axis is zero unless 
the velocity diverges; but if the velocity does diverge, the total energy would also be 
divergent. Therefore there is a minimum distance from the zenith and another from 
the nadir to the vertical axis. We have just proved that the motion always lies be- 
tween two circles on the sphere, each circle perpendicular to the vertical axis. Since 
we already know that the vertical component of velocity is periodic in time (in fact an 
elliptic function) it follows that the pendulum actually touches the circles alternately 
and that the time to go from one circle to the other is half the period. It can be proved 
that the azimuthal angle always advances each half-period by A/t, which satisfies the 
Halphen inequality \Ak\ < ti and the Puiseux inequality | Ak| > tt/2 (both of these 
inequalities can be proved using elementary complex analysis [23]). An illustration of 
the pendulum motion is given in Figure 6. 

The configuration which extremizes the functional (|6.2j ) can be written immediately 
once the spherical pendulum solution is known. It is most compactly written in terms 
of Weierstrass functions (our conventions are those of Whittaker and Watson |18| ) 
V[z) = Viz ; 5% gz) and their integrals. The period 2ui\ is purely real, while the period 
2uj 3 is purely imaginary. Explicitly, the analysis of Appendix B gives 

£ 



£,(x) = cos 1 



3 V(y + co 3 ) 



;n.i) 



e 



= ffl q>{(i & + i fe )(K f -i/_)-[c(i/ + )-CW]y} 



X 



a(y + u 3 + v+) cr(y + uj 3 - v_) 
\\ a(y + u 3 - v+) cr(y + uj 3 + ' 

(11.2) 
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Figure 6: The path of the spherical pendulum (shown in red). The motion is restricted 
between two circles (shown in green). The advance in azimuthal angle as the pendulum 
moves from one circle to the other is between 7r/2 and it. This also depicts an extremum 
of (\6.2Q for the 0(3) sigma model, where the role of time is played by the position 
coordinate x. 



where 

2NCU, 

y = —j—{x-x ) ■ 

There are two independent real parameters in this solution, because the constants in 



HID, ( jig) are related by 



4 + 3 ' 4 27 3 ' 

u + = oji + i(3 , z/_ = 27 , 

V(u±) = ~±l, V'(u ± ) = -2tVb, (11.3) 

g2 and g^ are related to &% and oj^ in the standard way and (3 and 7 are real positive 
numbers, which are less than — 2iu 3 . This is the most general solution, but it usually 
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does not satisfy the periodic boundary condition. The expression flll.l ) is periodic in 
£, that is £(L) = £(0), but in general ( |11.2j ) is not periodic in k. 

We note that the invariants #2 and g% are not completely arbitrary, because of the 
inequalities 

ei>l-|, -1 - | < e 3 < e 2 < 1 - | . (11.4) 



From (|11.2 ) it is found that the advance in k during a half-period is 



Ak = k(L) - k(0) = 2iwi[C(i/+) - C(^-)] - 2«7i(^+ - . 

Imposing periodic boundary conditions forces An to be a rational multiple of ir. The 
possibilities include An = ±7r. Then the spherical pendulum solution reduces to that 
of the planar pendulum with the identification sin a = sin £ sin k and cos a = cos£. 
In this case, U[ip] coincides with the 0(2) expression as does p[ip,ip ] (this follows 
immediately from (|4.3| )). 

The results of the previous paragraph show the conclusion of Feynman Jl5| that 



orbits containing configurations of nearly-constant derivatives are a small metric dis- 
tance from ifjQ is false. He considered the distance between orbits containing circulating 
configurations such as ( [7.21) and ipQ (his configurations were not explicit mathemati- 
cally, but he correctly argued that circulating configurations could be made to have 
arbitrarily small potential energy). The distance p circ (k) we have already calculated 
shows that these are far (of order yL) from ^ for lowest potential energy. Feynman 
claimed that a path in orbit space between these two orbits could be made short by 
what he called "slipping the loops about". He was referring to the fact that the first 
homotopy group of the two-sphere is trivial. However, this statement is false because 
the triangle inequality ( p. 11 ) implies any such path has a length greater than p ctrc (k) 



IT]. Nonetheless, there is some merit in Feynman's notion of making paths by "slip- 
ping the loops about". We will show later in this section that most of the important 
tunneling paths are of this type. 

The configurations (|1 1 . 1|) , ( p.1.2 ) for the 0(3) sigma model are related to the soliton 



configurations of a closely related model. Let us consider adding an external source to 



the 0(3) sigma model action (|3.1|) as we did for the 0(2) model in ([7.5|) : 



S = —— / dtdx ( d t s T d t s — d x s T d x s + h T s ) . 
2eo J 

This is no longer the periodic sine-Gordon action, as there are two independent field 
components, instead of one. The direction of h can be chosen so that the action 
becomes 

S = ^- I dtdx {{dtif - (<9,0 2 + sin 2 £ \{d t K) 2 - (O x k) 2 ] + h cosC} , 
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and the equation of motion is 



(d$ -dl)t + {d t sm 2 £d t -d x sm 2 £d x )K= -/isin£ . (11.5) 

Now substituting £(t,x) = ^(m), n(t,x) = n(u) where u = t ± -, as before, into ( |11.5| ) 
yields the equations of motion of the spherical pendulum: 

v 2 h 

dl£(u) + d u (sin 2 £ d u n) = - _ sing , 

The solutions to this equation are a train of solitons, just as in the 0(2) case, with 
spacing 



l32K 2 {v 2 - 1] 



v 2 h 



The difference is that this train is not, in general, a periodic configuration. The soli- 
tons change from one to the next, because of the angular shift 2Ak. Periodic soliton 
solutions do exist when Ak = ir, but these are a small subclass. While ( [T.6|) is a 
completely integrable partial differential equation, we do not expect that ( |11.5| ) is in- 
tegrable. Furthermore, there is no notion of topological stability for the solutions of 
the 0(3) equation ( |11.5| ). 



From the discussion above and Appendix A, it is clear that the 0(2) circulating 
solutions (|7.2[ ) is a local minimum of the potential energy, as is the N = 1 oscillating 
solution for small k, when periodic boundary conditions are imposed. 

It is not sufficient to examine only the 0(2) solutions ( |7.1| ) and (772) embedded in 



the 0(3) model. The reason is that (|1 1 . 1|) and ( |1 1 . 2| ) contain a much larger class of 



highly nonlinear minimal-energy configurations, which should be just as important in 
the thermodynamic limit. How do we impose the boundary condition? We will answer 
this question after examining first some general properties of (|11.1|) and ( |11.2|) . 



For the time being we assume no special boundary condition, but a very large 
system, where cos£ has period I = L/M, which is large compared to the short- distance 
cut-off. Then we can see that integrating over the long distance L should give a 
nonvanishing result for J s^dx/N, but zero for J L s\dx/N and J L S2dx/N. This is an 
ergodicity argument, and the result seems obvious, but is probably difficult to prove. 
Then the distance from ip is given by 

p(N;g 2 , g 3 ) 2 = L- \ [ L co S £dx\ = (1 + |)L - -i— [((2^ + cu 3 ) - CM] 

JO 3 Zl\Ldi 

= K 1 + f-£) <»■"> 
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The potential energy can be written down as an integral: 

tt,*t \ SN 2 u? fL r r £ (2Nuj x x Nil , 

C/(iV; (/ 2 , <? 3 ) = — ^ ^ [(1 + £) - [l + 3 + V ( — ^- + cu 3 jj j cfe . 

This integral can be evaluated and we find 

U{N , M3) ^(1 £ + ^. ,n,) 

We know that for some choice of invariants g 2 and #3, the right-hand side of ( 11.7|) 
will diverge in the ultraviolet, since it must reduce to the 0(2) result when the angular- 
momentum parameter b vanishes. Examining the integral expression for uoi. 

f°° _i 
wi = / [4(2 - ei)(z - e 2 )(z - e 3 )] » ds , 



it is clear that it diverges only when e\ — e 2 — > 0. On the other hand, rji/ui does not 
diverge anywhere except as e\ —>■ 00, in which case ( |11.7 ) vanishes. This can be seen 
by writing this expression in term of Jacobi elliptic integrals as 

^ = -«. + <«.-«)f§, (11-8) 

iO\ K(k) 

where 

k 2 = . (11.9) 

ei - e 3 



In fact, we can also do this for U\. 



Kik) 

Ul= / K J ■ 11.10 

y/ e l ~ e 3 

The relation ( |11.10| ) can be obtained from the identity 

p w = e » + i^^ ir (ii-n) 

and identifying the periods, where k is given by fllPD . Integration of (|TT7TlD and 
substitution of C(^i) — ^1 yields ( |11.8| ). The potential energy diverges logarithmically 
when ei — e 2 — > 0, or A; — *■ 1. It is otherwise finite, vanishing in the thermodynamic 
limit. 

To understand better where £/(iV; g 2 , #3) diverges, we write 
ei = W + <J , e 2 = W-5 , e 3 = -2W . 
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Figure 7: The configurations (11.1), (11.2) with large potential energy are spherical 
pendulum solutions for which the circles begin to close around the vertical axis. As the 
circles close completely, the advance in azimuthal angle approaches tt and the dynamics 
of the spherical pendulum becomes the dynamics of the planar pendulum. 



If 5 < 1/4, the inequalities (|11.4|) become 



2 -±^>W> 2 -^. (11.12) 



These new inequalities are completely consistent with the relations ( |11.3| ). The diver- 



gence occurs as 5 — ► 0. We will compute the angular-momentum parameter b and the 
azimuthal- angle advance An in this limit. 

As 5 vanishes, e%, e 2 and e 3 approach 2/3, 2/3 and —4/3, respectively, by (|11.12|) . 



We also find that £ — > 1. Then V{y±) = — ~ ± 1 implies that V(v + ) =2/3 and 
V{vJ) = —4/3. Hence v + = uj\ and z/_ = u 3 . But then V'{v±) 2 = 0, so that 6 = 0. It 
must therefore be true that a configuration in the orbit becomes an 0(2) extremal-curve 
configuration with k ~ 1. This can be checked by seeing whether 

• \Ak\ is 7r, and 

• p(N; g 2 , g?) approaches zero as e\ — e 2 — > 0. 
Explicitly 

Ak -> 2ia;i[C(o;i) - CM] ~ ^Vii^i ~ w 3 ) = 2^(771^2 - r] 2 ^i) = tt . 
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Figure 8: A two-dimensional river delta where tunneling can occur most readily though 
ultraviolet- divergent energy barriers. The 0(2) river valley passing through the point i[) 
in the orbit-space sphere around ipo is drawn in black, while the remainder of the delta 
is drawn in red. The curve of intersection of the delta and the sphere is drawn in blue. 



From flUJ ), flLOj) and ([1LT2|) 



p(N;g 2 ,g 3 ) 2 - L (l + i - ^ = . 

This shows that as the potential energy diverges, the distance between the config- 
urations ( |1 1 . 1|) , ( |11.2| ) and tpo vanishes. This property was also shown to be true for 
the 0(2) model. What is also significant is that in this limit the configuration becomes 
periodic! There are potential-energy barriers just as in the 0(2) model. In fact, the 
orbits close to ipo become what is essentially the 0(2) configuration for k ~ 1. The 
situation is shown schematically in Figure 7. 

Imagine moving along the barrier from ipQ. As p increases and the energy decreases 
the parameter e\ becomes more important, as it is no longer forced to be 2/3. There is 
now an extremal fan consisting of many adjacent extremal curves (with e\ — e 2 small), 
converging at the point of orbit space ipo- Consider a neighborhood G in the sphere 
around a b = solution ip of the form ( |1 1 . 1| ) , ( 11. 2j ) (this is identical to an 0(2) 



extremal-curve solution). There is a one- dimensional curve containing ip where the fan 
intersects the sphere. The endpoints of this curve are fixed by ([LI. 12 ). For any N, the 



fan contains a circulating 0(2) river valley ( |7. 1| ) . We call the fan a river delta. The 
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potential energy on this river delta is less than on the rest of G. The amplitude for 
barrier penetration is greatest along the delta. The situation is depicted in Figure 8. 

The paths along the river delta are essentially Feynman's "slipping the loops about" 
paths. The 0(2) river valley orbits inside the 0(3) river delta contain very symmetrical 
configurations, which are preserved, up to a sign, under reflection x — > L — x. This is 
not so for the general river-delta configurations. The first homotopy group of 0(3) is 
trivial and it is not hard to visualize these paths as the decay of a non-topologically- 
stable soliton. 

Since the extremal-fan configurations with significant potential energy are periodic, 
we make the approximation that all river-delta configurations can be replaced with 
modified configurations which satisfy periodic boundary conditions. We add an extra 
linear term to k(x), resulting in these boundary conditions. These are not strict minima 
of the potential energy for a given p. However their potential energy is the same as 
that of the corresponding 0(2) extremal-curve configurations within order 1/L. The 
modification we make is to replace ( |11.2j ) by 



e 



'"<*> = exp{(Vi + V2)(v + -V-)-[((v + )-((v-)]y 
+ ^-V + 2[C(f+) - C(v-)\V - 2??iK " v-)v} 



a(y + uj 3 + v + ) a(y + u 3 - v_) 



11.2)' 



It is now permissible to consider such a configuration with any choice of N. The river 
deltas in the model with periodic boundary conditions are paths through configurations 
of the form fliTTD , fllPD 7 . 

12 Tunneling and the topological charge 

Adding the term 

9 r 

S e = 9q = — j dtdx e abc s a d t s b d x s c 



— [ dtdx [<9 4 (cos£ d x n) — d x (cos^ d t K>) 
in J 



Air 

to the action has a significant effect on tunneling through the energy barriers. If 
the spacetime is the two-sphere and the action of the spacetime configuration s(t,x) 
is finite, then the number q is an integer, called the topological charge, and is the 
degree of the mapping from the two-sphere to itself. If the action is not finite, q is 
not quantized and the term "topological charge" is a misnomer. We will show in this 
section that the space-time field configuration generated by moving from ipo through 
the N = 1 river delta has a half-integer value of q. 
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As in the 0(2) model, an ultraviolet cut-off must be introduced. The tunneling for 
N — 1 proceeds as follows: 



1. First a wave packet of the form ( |S.2| ) expands freely from a constant configuration 
in ipo to an N = 1 extremal-fan configuration whose derivatives are limited by 
the cut-off. 

2. Next this river-delta configuration begins to stretch out continuously in space (its 
derivatives with respect to x becoming smaller) as p increases. Eventually the 
configuration becomes a constant (in i/jq) once again. 

Since boundary conditions in x are periodic, Stokes' theorem implies 

1 r L 

q— lim — / cos£d x tz . (12.1) 



ei— e2-*0 All JO 

Instead of doing a detailed calculation of the degree using our explicit configurations 
( |11.2| )', we can see what (|12.1 ) must be from an intuitive argument. The 



azimuthal-angle advance in the initial river-delta configuration must approach An = tt 
as e\ — C2 approaches zero, as we have already shown. This advance takes place over 
extremely short intervals in x. Since N = 1, the advance happens once on the pendulum 
down-swing x = xq + L/2, and again on the up-swing at x = xq. Since k is a compact 
variable, we must subtract 2ir from the advance at the up-swing. As e\ — €2 — > 

k — > ir[H(x — xo) — H(x — xq — L/2)} , 

where H is the step function, namely H(x) = for x < 0, H(x) = 1 for x > 0. Thus 
(H3D is 



1 f L 

q — — / cos^(x) • 7r [S(x — xo) — S(x — x — L/2)] dx 
An Jo 

= ^[cos^(a;o) - cos£(a;o + L/2)] . 

Since 6^0, the pendulum almost swings through the zenith and the nadir, so that 
£(xa) — > and ^(xo + L/2) — > n. The final result is 

9=|[l-(-l)] = |- 

As an orbit moves along the path through the barrier joining ip to itself, the wave 
functional must pick up a phase exp±i#/2. At 9 = n, passage through the barriers 
contribute factors of oscillating sign to the wave functional. This means that the wave 
functional will vanish somewhere along the barrier. Barrier configurations with N = 1 
are suppressed. However, multi-soliton barrier configurations may survive. 
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13 Discussion 



Our Hamiltonian methods, though semiclassical in nature, are very different from the 
saddle-point techniques most field theorists are accustomed to. They are conceptually 
more complicated than saddle-point methods, but are at least partially successful for 
the 0(3) sigma model. We do not know yet whether they can be developed to obtain 
quantitative results or if they might suggest a more powerful version of the saddle-point 
methods. 

We have not evaluated the barrier-penetration amplitude in the 0(3) sigma model. 
This is a formidable problem for several reasons. The regions of barrier penetration in 
orbit space are two-dimensional, unlike the case of the XY model, for which they are 
one-dimensional. More significantly, the task of understanding the role of nonlinear 
spin-wave fluctuations near the barriers will probably not be easy. In the XY model, 
further spin-wave corrections are not important because the spin waves are effectively 
Gaussian. 

Though there is still much we do not understand about the 0(3) model, our analysis 
has been revealing. We have found the explicit form of the barrier configurations. These 
truly have half-integer topological charge, just as predicted by Affleck and Haldane ||. 
However, the one-parameter family of barrier configurations are more general and more 
complicated than both the modified meron in Affleck's paper and the original meron 
of Gross . 

When 9 = 0, the Schrodinger wave functional on configuration space can be signifi- 
cantly different from zero only where the potential energy is small or near a river-delta 
configuration. This is strong evidence that barrier penetration produces the mass gap. 
The fact that the topological charge is half-integer means that at 9 = ir, there can be a 
massless phase. A phase transition may separate this from a massive strong-coupling 
phase, driven by pairs of barrier-penetration events (of vanishing or integer topological 
charge) . 

Feynman's [plj point of view as to how the gap arises was rather different from ours. 
He argued that the mass gap should arise because the vacuum wave functional should 
vanish everywhere on the orbit space, except on a region of finite diameter. This may 
be true, though it is a subtle matter in an asymptotically-free theory where all length 
scales are important. Quantum barrier penetration will occur for configurations other 
than the classical vacuum ipo; one effect of such barrier penetration may be to suppress 
configurations which are a distance p ~ yL from ip . 

We have left many questions unanswered and see three further directions for this 
research. 

While we find the phase transition in the Hamiltonian XY model, our technique 
is not yet powerful enough to fully understand even this model. Our methods are as 
good as vortex-mean-field theory ||, but we are not yet able to do a renormalization 
group analysis, as Kosterlitz first did for the Coulomb gas (or its equivalent, the sine- 
Gordon model). It may be that a Hamiltonian renormalization group of the barrier 
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configurations is possible. 

We would like to understand better the condensation of barrier configurations in 
the 0(3) model. This will require a proper incorporation of spin- wave effects. In 
any case, our results indicate that the instanton gas of integer topological charges 
24] is not sufficient to understand mass generation in the model. The one-parameter 



configurations we have found appear to be more general than merons flTCj. We expect 
that if a calculation could be done by a saddle-point method in Euclidean space, it 
would also reveal this to be the case. 

Finally, the extremal problem for the sigma model ( |6.2j ) has an analogue in gauge 
theories. Some progress has been made towards its solution [p5|], building on some of 



the work here and in reference 13 
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Appendix A: Which extremal curves are minimal 
curves? 

In this appendix we determine which of our extremal curves for the 0(2) model are 
what we call river valleys. These must be minimal curves. We need to know whether 
a point (actually a field configuration or an orbit) on such a curve, constrained to be 
on the sphere (of constant r from So or of constant p from ip ) is in stable classical 
equilibrium. 

Once the situation is understood for the 0(2) model, it is possible to decide the 
issue of river deltas in the 0(3) model, discussed in section 11. 

Consider a solution of ( p-'2\) for the 0(2) case. Suppose that the angle a(x) is an 
extrema curve configurations such as ( |7.1|) or (|7T2|). If we vary this angle by 5a (x), the 
third potential energy changes to second order in 5a by 

f L ( x d * K d W , 
oil = — 5a— -^a H — da— -—da dx 

Jo \ dx 2 2 ax 2 ) 



1 f L ( d 2 \ 

-J Sa ( — ^5a — 2A sin a j dx , (Al) 



where A is the Lagrange multiplier in ( |6.2| ) and we have used the pendulum equation of 
motion. We want to know whether 5U is positive for any acceptable 5a, other than the 
zero mode of translation invariance. We say "acceptable" because 5a is not arbitrary. 
There is a constraint in (|6.2| ) on 5a, namely that the variation must not change the 
distance in orbit space to ip . 
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The constraint means that 5a must satisfy 

' l 1 2 T l 

/ cos(a + 5a) dx + sin(a + 5a) dx 
Jo Jo 



cos a dx 



Expanding this up to second order in 5a gives 



sin a dx 



-vi 



+ v 2 



5a sin a + - {5a) 2 cos a 



5a cos a — -(5a) 2 sin a 



dx + 
dx + 



5a sin a dx 



5a cos a dx 



(A2) 



where 



/ cos a dx , v 2 = / sin a . 
Jo Jo 



The second-order form of the constraint (|A2|) can be simplified somewhat. 

Let us rotate a so that v i = v and v 2 = 0, as is the case for ( |7. 1| ) , (|7.2|) . The same 
conditions can be imposed on a + 5a. The expansion of these conditions to second 
order yields 








5a sin a + - (5a) 2 cos a 
5a cos a — -(5a) 2 sin a 



dx 
dx . 



(A3) 
(A4) 



Conditions ( |A3| ) and ( |A4p are equivalent to ( |A2| ). The variation of the potential energy 
[1|) upon substitution of ( |A3| ) is 



5U = - 



1 /-i 



2 Jo 



5a ( -j—^ + A cos a j 5a 



(A5) 



Substituting ( |7.1| ) and ([7.2[), equation ( |A5| ) becomes 



1 /•£ 

SU= - I <)n 



d 2 IQN 2 K 2 ( ^ nf2 2 4A^is:(x -x ) 
'^ + ^^ - 1 + 2 * S " L ' 



<5a (A6) 



and 



5C/ 



l r L 



2 Jo 



5a 



di 1 AN 2 K 2 k 2 



dx 2 



+ 



L 2 



-1 + 2sn 



,2NK(x-x ) 



5a dx 



(A7) 
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respectively. 

The eigenvalue equation for the operator in square brackets in each of ( AB) and ( |A7[ ) 



is a Hill equation, specifically a Lame equation JTBJ of order one. It is still necessary 
to impose the constraints (|A3|) and (A4). 



Our problem has become the following question: under what circumstances are the 



quadratic forms in (|A6|) and (|A7|) positive on variations satisfying (|A3|) and (|A4|) ? 



After an appropriate rescaling, the Lame operators are 

H osc = H — 1 , (A8) 
on periodic functions of period 4NK, for the oscillating extremal curves (|7. 1|) and 

n circ = n-k\ (A9) 



on periodic functions of period 2NK, for the circulating extremal curves ( |7.2| ), where 
in each case 

d 2 

H = -— + 2k 2 sn 2 u . 
du z 



To answer this question it is first necessary to see whether the spectrum of ( |A8| ) or 
is positive-definite with a single zero mode (corresponding to translation invariance of 
the extremal curve configuration ( |7. 1| ) or ( |7.2| ), respectively). If this is the case for a 



particular Lame operator, any variation raises the potential energy. If not, meaning 
that there is a negative eigenvalue in the spectrum, it must then be checked on a case- 
by-case basis whether variations, other than the zero mode of translation invariance, 
satisfying ( |A3|) and (|A4|) can give a nonpositive 5U. 

The zero mode corresponding to translations for ( |A8|) is cnu and for ( |A"9| ) is dnu, 
which are both doubly-periodic functions. These modes are obtained by simply differ- 
entiating ( |7.1| ) and ( |7.2| ) with respect to x$. Recall that the ground-state eigenfunction 
of a quantum-mechanical Hamiltonian must be a unique (i.e. nondegenerate) real func- 
tion which vanishes nowhere in the physical region of u. Every other real eigenfunction 
must have at least one node. In fact if a given eigenfunction possesses no zeros in the 
physical region it must be the ground-state eigenfunction. This eigenfunction is dnu 
(which satisfies the boundary conditions for ( |A"8] ) and ( |A9| ) ) . 

Let us first consider the case of the oscillating extremal curves, (|A8|) . We can see 
readily that there is a negative eigenvalue in the spectrum. We have shown that the 
ground-state eigenfunction for either the oscillating and the circulating case, is dnu 
which has period 2K and no nodes. Its eigenvalue of 7i OS c is —k' 2 . We will not do 
the analysis here to determine the effect of the constraints (|A3|) , ( |A4]) on SU (though 



it can probably be done, as the spectrum of the order-one Lame with our boundary 
conditions can be completely determined). We will only note that the N = 1 oscillating 
solution ( |7.1| ) for sufficiently small k must be a local minimum on the sphere of radius 
Pq. For there are only three kinds of extremal configurations on the sphere (see figure 

1): 
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• the k < k* oscillating solutions 

• the k > k* oscillating solutions. 

• the circulating solutions ( |7.2| ), if p Circ (0) < po- 

If po is sufficiently small, a configuration of the third type is a barrier configuration. 
In that case, among the three possibilities, the N = 1 oscillating solution with k < k* 
has the smallest potential energy. Since the potential energy is bounded below, it must 
have a minimum value on the sphere. 



For the circulating extremal curves (|A9 ), the ground state dnu is also the zero 



mode, so the remainder of the spectrum is positive. Hence all the circulating extremal 
curves are river valleys. 



Appendix B: The spherical pendulum 

The Lagrangian of the spherical pendulum is 

£ = ^ 2 + ^sin 2 £«: 2 -/i(l-cosO . 

The equation of motion for k is the statement that the vertical component of the 
angular-momentum vector is conserved, that is 

^ = 0, l z = sm 2 £k. (Bl) 

The conserved energy is 

E pend = + + Ml - cos . (B2) 

i — i 2 
Define the new time coordinate r = J and the new conserved quantities b = ^- and 

£ = E pen d/ P — 1- The energy-conservation relation, ( |B2| ) becomes 

\dr ) v ' sm 2 £ 

Let Z = — cos£. The energy-conservation equation is then 

^fpj = 4Z 3 - AEZ 2 -AZ + 4(£ - 6) = M{Z) . (B3) 

The physical region is, by definition, Z G [—1,1]. Since (dZ/dr) 2 must be positive 
in this region, the cubic polynomial M(Z) on the left-hand side of ( [B3D must have 
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some positive values in this region. If Z = ±1, then M(Z) has the value —46. Hence 
the polynomial has at least two real roots between —1 and 1. The motion of the 
pendulum is such that the value of Z is between these two roots. Furthermore, if ( |B3| ) 
is continued outside of the physical region, in the limit as Z — > oo, the behavior of 
M(Z) is M(Z) — > oo. Therefore there is one (unphysical) root of M(Z) in the open 
interval Z 6 (1, oo). 

After an appropriate shift of Z by — £/3, eliminating the quadratic term in the 
polynomial M(Z), it is straightforward to solve (|B3| ) in terms of the Weierstrass elliptic 
function V(z; g 2l g%) = Viz). The solution is 

Z = V[r- r ) + £ 



3 



where 



92 = 4 ( 1 + Tj ' 9 > = 4 {W-T + b ) ■ 

We can parametrize the Weierstrass function by e%, e 2 and e 3 instead of g 2 and ^3. 
These numbers are just the roots of the polynomial on the right-hand side of ( |B3| ) 
shifted by —S/3. They are real, add up to zero and satisfy 

EE E 
ei > 1 - - , -1 - - < e 3 < e 2 < 1 - - . 

Since e\ is positive, is negative, while e 2 can be of either sign. The motion of the 
pendulum always lies between two circles on the sphere. The pendulum reaches the 
bottom circle at time r such that V[j — t ) = e 3 and the top circle at time r such that 
"P(t — t ) = e 2 . If the initial condition at r = is chosen so that Z lies on the bottom 
circle, then — r is fixed to be U3. 
Notice that 



/ (At 3 - g 2 t - g 3 y 1/2 dt 



Ml 

is purely real, while 

^3 = ~i f 3 (93 + 92t - At 3 )~^ 2 dt 

J —00 



is purely imaginary. 

Equation flBll) can now be written 



dK = 2Vb 

dr 1- [p( r + cu 3 ) + f] 2 ' 1 J 
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To integrate ( |B4j ) requires two steps. The first is to reduce the denominator, which is 
quadratic in Vir + 003) to the sum of terms with denominators linear in this function, 
that is 

1 1111 



l-(P + f) 2 ' 2P+f + l 2P + f 
To integrate such an expression, we prove the identity 

V'(v) 



V{z)-V{v) 



2((v) + ((z-v)-az + v). (B5) 



We will show that each of the two sides of ( p5[) has the same zeros and poles (and 
multiplicities thereof) with the same residues of the poles. Equation ( |B5l) then follows. 
Since ((z + 2ui) = ((z) + 2r]i, the right-hand side is an elliptic function. The zeros 
of the right-hand side of (|B5|) are points congruent to zero, since the Weierstrass zeta 
function is odd. Furthermore, the order of each of these zeros is two, for as z — > 0, 

2C(f ) + C(* C(z + -z[V(-u) - V{u)} + 0(z 2 ) = 0(z 2 ) . 

The poles of the right-hand side are points congruent to z = ±z/, since ((z) has poles 
congruent to zero. These are simple poles. We therefore have in each period cell one 
zero of order two and two poles of order one. The poles have residue ±1 at ±z/. Now 
the left-hand side of ( p5|) has double zeros at the double poles of V{z)\ these are the 
points congruent to zero. Furthermore, it has simple poles at the points congruent to 
±u, with residue 

V'iv) 

lim ^7 x L, ^ (Z±V)=±1. 

Therefore ( p5|) is correct. 

The right-hand side of (IBBf) is easy to integrate, giving the result 



/ 



dz 1 r . . cr(z-v) 

■[2z({u) +log- 



P(z)-V(u) V'{yY ^ y ' * a{z + uy ' 
Define two complex numbers v + and z/_ by 

V{v ± )=~±l. (B6) 

These numbers are determined up to 2mui + 2nu^ as well as an overall sign. Now 

V\V-\z)) = (4z 3 -g 2 z-g 3 ) 1 / 2 , 
and substituting ( p6|) gives 

V\v ± f = -46 . 
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Since V is an odd function, we have the option of choosing 

V'{u±) = -2iVb. 

This choice is satisfied by the expressions for v± evaluated below. 
Inverting ( |B6| ) gives 



v± = u 3 + 



dz 



l±i J A (z - ex) (z - e 2 ) (z - e 3 ) 



(B7) 



We can immediately see from (|B7|) and — ~ — 1 < e 3 < — | + 1 that z/„ is purely 
imaginary. On the other hand, v + has a nonzero real part. Explicitly, 



v + = u 3 



f±i 



dz 



A(z-ei) (z-e 2 ) (z - e 3 ) 



^3 + 



-1+1 



dz 



y/-4: (d - 2) (z - e 2 ) (2 - e 3 ) 



3 yj4( ei -z) (e 2 -z) (z-e 3 ) 



(B8) 



The first term of (|B8|) is purely imaginary. So is the second; this follows from e± > 



— I + 1. The last term is oo 2 — oo 3 = uoi + 2u 2 . We therefore have (adding —2oo 2 to 
which changes nothing) 



v + = uii + i(3 , v_ 
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where j3 and 7 are real constants between zero and —2iu 3 . 
The solution of (1B~4T) can now be written down. It is 



i k(x) 



exp{(r/i + j/2)(z/+ - v-) - [((v + ) - C(v-)]t} 



— v_ 



cr(r + lu 3 + ^+) cr(r + uj 3 
\ cr(r + co> 3 - cr(r + u; 3 + z/_) 
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